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Abstract 

\ The importance of the H — > ZZ M "golden" channel was shown by its major role in the 

1^ 

pi I discovery, by the ATLAS and CMS collaborations, of a Higgs-like boson with mass near 125 GeV. 

D. 

• We analyze the discrimination power of the matrix element method both for separating the signal 

from the irreducible ZZ background and for distinguishing various spin and parity hypotheses 
describing a signal in this channel. We show that the proper treatment of interference effects 
associated with permutations of identical leptons in the 4e and 4/i hnal states plays an important 
role in achieving the best sensitivity in measuring the properties of the newly discovered boson. We 
provide a code, MEKD, that calculates kinematic discriminants based on the full leading order 
matrix elements and which will aid experimentalists and phenomenologists in their continuing 
studies of the H ZZ — > channel. 



* I Corresponding author: gainer@phys.ufl.edu 
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I. INTRODUCTION 



The CERN LHC collaborations recently reported the observation of a new bosonic particle 
with mass m ~ 125 GeV . The production rates in the main discovery channels are 
consistent with the expectations for the Higgs boson, H, of the Standard Model (SM). 
The most useful channels for the discovery of the Higgs-like boson and early measurements 



of its properties are — > 77 



m 



and H ^ ZZ ^ 



6|-|9|. Each of these discovery 



channels has strengths and weaknesses. For example, observation of the appropriate excess 



in diphoton events immediately implies that the discovered resonance is not spin one 



10 



At the same time, because of the much larger background, measuring the exact properties 



12 



8|. 



of the object in 77 events will be quite challenging 

In contrast, the "golden channel" H — > ZZ — )■ 44j offers the opportunity for clean 
measurements of the mass, spin, parity, etc. of the new resonance in a controlled environment 
with low backgrounds. Furthermore, the four lepton final state allows experiments to probe 
the polarization of the intermediate Z bosons through angular correlations. Thus, most 
of previous theoretical work on spin and parity discrimination has concentrated on this 



channel 



12 



16 



19 



35| : see also 36|-|40j. In fact, CMS and ATLAS have already shown that 



the interpretation of this resonance as CP odd is strongly disfavored, using information from 
this channel 

At the same time, relatively little theoretical effort has gone into using the kinematics 
of four leptons, besides their invariant mass, for discriminating between the Higgs signal 



30 



31 



4l| . Discriminating 



and the SM background in this channel; see, however 
between signal and background is an important issue, though, whether for determining the 
significance of the signal in this channel or for accurately measuring the spin and parity 
properties of the resonance. This is because for an SM Higgs boson with mass near 125 
GeV, the rate of background events in comparison with that of the signal is not negligible; 
in fact, the signal to background ratio for a four lepton mass window of ±2crm^^ is roughly 
2:1 |8|,l9|. (Here, o'm4(. stands for the experimental four lepton mass resolution.) 

Another motivation for revisiting the previous work on this channel is that much of the 
previous literature is limited to the case of a heavy Higgs boson, with mass above the ZZ 
^ Throughout this paper we shall use the convention that Z stands for both on-shell and off-shell Z bosons, 

as well as 7* when allowed, while Z* is used for either an off-shell Z or for 7*. We never make the 

approximation that a Z is on-shell. 



threshold, where both Z bosons produced in the Higgs decay will be on-shell. Most previous 
studies of this channel in the low Higgs boson mass range have lacked diagrams involving 
permutations of identical leptons, and hence the associated interference, in the 4e and 
final states. Such permutations are relatively unimportant when both Z bosons are mostly 
on-shell. However, as we show in this paper, the inclusion of these interference effects plays 
an important role in achieving the best possible sensitivities in the low mass range and is 
crucially important for separating spin zero and spin two resonance hypotheses. 

One of the great advantages of the golden channel is that the final state is fully re- 
constructed and well-measured. However, the presence of four leptons in the final state 
means that, at leading order, there are eight independent observed degrees of freedom 
(in the Higgs CM frame), not counting the irrelevant azimuthal orientation of the event. 
The existence of eight mea ning ful kinematic variables strongly motivates the use of a Mul- 



tivariate Analysis (MVA) 



(MEM) 



16 



30 



31 



41 



43 



technique, and in particular the Matrix Element Method 
45|, thereby allowing all of the information in each event to be 
used in either distinguishing signal from background or in distinguishing between differ- 
ent signal spin/parity hypotheses. Both the MEM and other MVA approaches presumably 
achieve an approximately identical level of discrimination. However, the MEM has several 
advantages over other MVAs. The quantity used in the MEM analysis is the (squared) 
matrix element, which is (up to approximations used) uniquely defined from first principles, 
while MVA-based discriminants require an ad hoc training on very large Monte Carlo sam- 
ples. Since the matrix element has a clear, well-understood physical meaning, one is able to 
make a direct connection between the features of a statistical analysis and the underlying 
physics. This is particularly true in the case of the golden channel, as the distributions of 
kinematic variables can be determined from the amplitudes for producing Z bosons with 

nn 

given helicities [46|, |47[ . 

In contrast with even the recent past, there are now a number of commonly used and 



well-tested programs which can be utilized to calculate matrix elements automatically 



48|. 



These include tools to generate model files from an arbitrary Lagrangian, such as Feyn- 



RULES [49| and LanHEP |50[, as wel 



model files, such as MadGraph [51 



as tools to calculate the matrix elements using these 



52|, CalcHEP |53|, and CompHEP y,|55|. Taken 



together, these tools allow for relatively automatic implementation of new models and auto 



matic generation of matrix elements, as explained at the TASI-2011 summer school 



56| and 



the MC4BSM-2012 workshop 48|, |57|. The existence of such tools considerably simplifies 
calculations of leading order matrix elements for any desired process and strongly motivates 
the use of the MEM where possible. It is this approach that we take in the presented paper. 



In Sec. [ni we review the MEM and describe the associated variable, the kinematic dis- 
criminant KD, which quantifies the extent to which a particular event is described by one 
hypothesis as opposed to another. We also show the superiority of the MEM over analyses 
involving fewer variables. Next, we calculate KD for signal and background events u sing 



different tools: MadGraph |52l] in Sec. [1111 and CalcHEP |53| and NLOME |58|, [59 1 
in Appendix \M In Sec. IIVI we compare the results for the kinematic discriminant com- 
puted with these three tools and show that they are in agreement. We also analyze the 
relevance of interference in the 4e and 4/i final states in the context of SM Higgs boson 
signal vs. background separation. In Sec. |Vl we discuss the added benefit from incorpo- 
rating experimentally-known information about the initial state into the analysis. Sec. IVII 
quantifies the effect of the spin and CP of the resonance on the ability to separate signal 
from background. 

In Sec. IVIII we demonstrate the application of the MEM to discrimination between dif- 
ferent spin/parity signal hypotheses. We show that properly taking into account the above- 
mentioned interference effects in the 4e and 4/i final states significantly enhances the sepa- 
ration power between alternative signal hypotheses. Sec. IVIIII summarizes our findings and 
outlines directions for future work. The notation for the relevant kinematic variables in the 
Higgs golden channel are given in Appendix [Bl 

In order to facilitate future MEM-based studies of the properties of the newly discovered 
four lepton resonance by the experimental and theoretical communities, we are making 
public one of the two new KD codes used in this paper (the MADGRAPH-based code, 
which we call MEKD). The instructions on how to install and run the code are given in 
Appendix O This code calculates matrix elements and kinematic discriminants, natively 
including diagrams with swapped identical leptons (and the associated interference) in the 
4e and 4/i final states, background diagrams with 7* propagators for the "doubly resonant" 



qq — i- ZZ — 7- M process and singly resonant gg — > Z — )• 4£ production [60|. It does not use 



a narrow width approximation for either the Z bosons or the signal resonance. 



II. PRELIMINARIES 



We now define the MEM and note some modifications to the general procedure which are 
useful in studying the golden channel. We then describe a practical method for displaying 
the sensitivity of different kinematic discriminants. Finally, we demonstrate the increase in 
discrimination power that may be obtained using the MEM, in comparison with analyses 
which use fewer variables. 



A. The matrix element method and its kinematic discriminant KD 

For a four lepton event described by kinematic information p, the best possible discrimi- 
nant between two production hypotheses A (e.g. signal) and B (e.g. background), is the ratio 
of the probabilities to observe such an event, given the alternative production hypotheses: 

DiA;B) = ^M^. (1) 

Here, P(p | A) and P(p | B) arc the probability density functions {pdfs) for observing the 
event in the cases of hypotheses A and B, respectively. Note that the alternative definition 
of the discriminant D: 

may seem to be more suited for the case when the alternative hypotheses are signal+background 
or background- only. However, D is a monotonic function of D and, hence, considering D 
rather than D does not add or subtract any information. 

In particle physics, probabihties -P(p) can be represented by the differential cross section 
of a signal or background process with respect to the variables considered, da /dp, scaled 
by the instrumental efficiencies e(p), and normalized by the appropriate total cross section 
within the instrumental acceptance: 

^ da/dp e(p) 
Jda/dpe{p)dp 

Following the same logic of comparing discriminants D and D, one can see that the nor- 
malization constants in the definitions of P(p) do not matter and can be dropped; the 
instrumental efficiencies appearing in the numerator and denominator cancel out; and one 



arrives at the following definition of the kinematic discriminant between two chosen processes 
A and B, e.g. signal and background: 

KD{A- B)=l4 P^) ■ (4) 



das/dp^ 

We use log of the ratio only for technical convenience; this is dictated by the large dynamic 
range in the ratio of cross sections involved. If all signal and background processes involve 
effectively massless initial state partons (as is often the case), the phase space factors are 
identical for all processes and hence cancel in the ratio. Thus, we find that 



KD{A;B) = In 



/ Efaixi)-Mx2)-\MAia + b^Ai)f \ 

a,b 

. E fa'{x,)-Mx,)-\MB{a' + b'^U)\\ 

\a',b' / 



(5) 



In this equation, a and b {a' and b') stand for different types of partons which can produce 
the four lepton final state via process A (B) with corresponding matrix elements AiA ( -Mb)- 
The momenta of the partons are fully defined by the final state and are denoted by Xi and X2 
in units of the initial proton energy. The functions f{x) are the parton distribution functions 
(PDFs), with the subscripts indicating which types of partons they correspond to. 

The products of PDFs in the numerator and denominator of Eq. ([5]) in general do not 
cancel. In particular they do not cancel if hypothesis A is gg H ZZ 4i and 
hypothesis 5 is gg — )■ ZZ* — )■ Ai. Note that Eq. (j5]) is invariant under any changes of 
variables, as any resulting Jacobian would be common to both the signal and background 
expressions and hence would cancel in the ratio. 

One may wish to ignore the information related to the momentum distribution of the 
initial state partons inside the proton and effectively consider only the information encoded 
in the decay itself. Then the kinematic discriminant ([5]) simplifies to become: 

KD(A,B) = J^^1^^^±^^]. (6) 

In Sec. lYl we explicitly analyze whether this simplification results in a substantial loss in 
our ability to discriminate signal from background. 

There is an experimental complication that we have ignored so far, namely that the 
momenta of the final state leptons are not perfectly measured. Generally, one can take this 
into account by including an integration over transfer functions for the final state momenta 

8 



in both the numerator and denominator of Eqs. ([5]) or ([6]); these integrals do not cancel in 
the ratio. The transfer functions would need to include parameterizations of momentum 
mismeasurements and reconstruction efficiencies in the entire phase space relevant for the 
analysis. Since leptons are well-measured, one can use delta function transfer functions 
for many applications involving leptons in the final state. However, in the case in which 
the signal process is the production of a narrow resonance, the value of the signal matrix 
element will be very sensitive to small perturbations in the measured invariant mass when 
the instrumental mass resolution is similar to or greater than the resonance width. This 
is the case for the channel we consider; the width of a 125 GeV SM Higgs boson is only 
about 4 MeV 6l|, while the instrumental four lepton mass resolution is in neighborhood of 



1 GeV 6|-|9|]. To avoid complications associated with using transfer functions, we employ the 
approach used in the CMS H — t- ZZ* — )■ Ai analyses {3, IJ • In this approach, the four lepton 
mass information is factorized from the rest of the kinematic information by calculating the 
signal matrix element for ttih = fn^e^^ where mf/ is the observed four lepton mass of a given 
event: 

\MHia + b^4:i)f = \MHigg ^ H ^ MlniH = mlf)\\ (7) 

This matrix element can be used both in Eqs. ([5]) and ([6]). The instrumental four lepton 
mass resolution is then encoded in 2D-p(i/'s: 

• for signal: pdjuiruu, KD \ itlh) = pdjuimu \ mn) ■ pdjuiKD \ m^i); 

• for background: pdfzzirnu, KD) = pdfzzirnu) ■ pdfzz{KD \ m^e), 
which are ultimately used for the construction of the test statistic itself. 



B. Event generation 



For the analyses described below, we use parton-level events generated with MadGraph 



52 



62 



63| . Both signal and background events were generated within the four lepton mass 



window 120 < < 130 GeV. When generating the signal, we use a nominal Higgs boson 
mass of rriH = 125 GeV. For background ZZ events, only processes initiated by the first 
two generation quarks were generated. The lepton acceptance cuts were 



Pt > 7 GeV and |?7| < 2.5 for electrons. 



• Pt > 5 GeV and \ri\ < 2.4 for muons. 



Events are generated for each possible choice of final state flavors, namely e~e^fi~fi'^ ( "differ- 
ent favor", DF), and e~e~^e~e~^ and ("same flavor", SF). In the case of DF events, 
opposite-charge lepton pairs are formed by flavor {e~e~^){fi~fi~^). In the SF case, two differ- 
ent pairings of opposite-charge leptons are possible, e.g. (e5'e^)(eje4 ) and (e]~e4 )(eje^). 
The invariant mass of a pair closest to the mass of Z boson is mzi {^12) in CMS (ATLAS) 
notations, while the invariant mass of the other pair is mz2 {^34) ■ We accept the event 
if at least one pairing of leptons passes the invariant mass cuts of 40 and 12 GeV on rrizi 
(mi2) and mz2 {^34), respectively. We do not perform any detector simulation as our pri- 
mary concern is with the comparison of the different analyses as opposed to the absolute 
performance of any given analysis. 



C. ROC curves 



We wish to characterize the extent to which our KD is useful in enhancing the sensi- 
tivity of an analysis. One can generate pseudoexperiments using the two-dimensional pdfs 
(involving KD and invariant mass m^i) described above and perform a statistical analysis on 
how well two hypotheses can be separated. In fact, we perform such a prototypical analysis 
in Sec. IVIII However, a simpler and more intuitive means of displaying the extent to which 
signal and background may be separated is provided by "Receiver Operating Characteristic" 



(ROC) curves [64 1- 



Using generated events (see previous subsection), we build distributions of KD{A; B) for 
processes A and B that we wish to compare. Then, we calculate cumulative probabilities 
eA{KD > KDcvt) and eeiKD > KDcvt)- The plot of eA{KD > KDqvt) versus 
esiKD > KDqu'y), as one smoothly increases the cut value KDqut, is the ROC curve. 
The curve gives the fraction of process A events that pass a given cut on KD in terms of 
the fraction of process B events passing the same given cut. The further the curve is from 
the diagonal (e^ = e^), the better the cut has separated the two processes. 

10 



D. Comparison with single variable analyses 



The most sensitive kinematic variable for distinguishing signal and background is, of 
course, the invariant mass of the event. However, if we want to move beyond this and obtain 
greater sensitivity, a natural question is whether we truly need to use all of the kinematic 
variables in our analysis, or whether using one or two particularly sensitive variables would 



suffice. We therefore investigate the angular variables 9* and $, in the convention of Re: 



as well as m^2, the mass of the less massive opposite-charge dilepton pair 65|. We 



s. H 



choose mz2 in particular, as due to the Z7* contribution to the background, the signal and 
background shapes are significantly different. The ZZ background has a substantial fraction 
of events with mz2 values close to the cut value of 12 GeV, while for a Higgs boson decaying 
to actual Z bosons at tree level, the low values of mz2 are substantially less likely (due to the 
loop suppression, the H Z7* decay can be neglected). The motivation for considering 
the angle 9* , which is the Z boson polar angle in the center of mass frame of the event, 
is that the background 9* distribution is much more peaked in the forward and backward 
directions (in comparison with signal). This is due to the t-channel nature of the qq — ?■ ZZ 
processes. The angle the angle between the two dilepton decay planes, has often been 
considered in the literature for distinguishing between spin and parity hypotheses (see, e.g.. 



Refs. 



23|). 



Using simulated SM Higgs boson and ZZ background events, we construct numerical pdfs 
Ps{x) and Pb{x) for each of these variables (here x stands for either mz2 or 9* or $). We then 
take the ratio of these probabilities to construct discriminants and produce corresponding 
ROC curves. These three ROC curves are shown in Fig. [H along with the ROC curve 
constructed using the MEM KD{H; ZZ) calculated using the MadGraph matrix element 
(see Sec. Illll for details). One can see that the analysis utilizing KD{H; ZZ) based on the 
complete matrix element is the most sensitive. We also note that 'mz2 is very sensitive in its 
own right. This observation will become important in Sec. IVH where we discuss the signal 
vs. background separation when assuming different signal hypotheses. 

11 
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Background fraction 

FIG. 1. Comparison of different ROC curves based on: the value of KD obtained from matrix 
elements (red curve), mz2 (blue curve), ^ (magenta curve), and 6* (black curve). The black dashed 
diagonal line is the ROC curve obtained from cutting events indiscriminately (e.g., by flipping a 
fair coin or by only considering some fraction of the data set). 



E. Overview of available tools for calculating KD 



Since an event generator performs a Monte Carlo integration of a matrix element squared 
over the relevant phase space, in principle any event generator can be used to calculate the 
matrix element, given the final state kinematics. Unfortunately, not all event generators 
have implemented this functionality. 



In this paper, we shall use MadGraph [51|, 1521], and CalcHEP 531], which can easily 
provide analytical expressions for the matrix element for the relevant 2 — i- 4 scattering 
processes, e.g, gg ^ H ^ ZZ* — ?■ 4i for signal and qq — > ZZ* — > Ai for the background. 
MadGraph and CalcHEP are on a relatively equal footing — they are both highly 
automated and very flexible, and thus provide a useful cross-check on each other (this test 



12 



is performed in Appendix Using the built-in flexibility of these generators, we construct 
a higher-level package, MEKD (Sec. IIIII) . that calculates complete leading order matrix 
elements for gg X ^ 4i and qq — )■ Ai for any given four lepton final state. This package 
was used along with the MELA package, to be described below, by CMS in their most 
recent update on the observation of the newly discovered boson decaying to ZZ Ai and 
the measurements of its properties in this channel ^|. 

As an alternative to the automated tools, one could utilize a specialized package, perhaps 
because it has the expressions for the relevant matrix elements readily available or because 
it offers extra functionality not accessible in the automated tools. Two examples of this type 
include NLOMEf jsQ], which is based on the MCFM event generator jsS], and JHUgen 

NLOME over all other competitors is its next-to-leading 



|66l |. The crucial advantage o: 
order (NLO) functionality j43|. 



44| . On the other hand, the H — ?■ ZZ — 4i channel is built 



around the matrix element for the DP e~e~^fi~fi~^ final state. Therefore, when applying it 
to events with the SF 4e or 4/i final states, the interference effects are missed. Also, signals 
of alternative spins and parities are not available in this package. JHUgen allows one to 
extract the leading order matrix element for a broad spectrum of possible signal hypotheses, 
but does not calculate the matrix element for backgrounds. 

There is yet another option: one may wish to code one's own software package using 



expressions for matrix elements found in the literature, such as those in Ref. |16l . |30| for 
signal and Ref. 4jJ, |45| for background. This approach was used by both CMS and ATLAS. 
Although the corresponding software packages used by the two experiments are not publicly 
available, for completeness we describe their main features, derived from the references the 
codes are based on. 

In their new boson discovery paper |7[jthe CMS collaboration used the MELA package 
which is largely based on the work in 16|, l30|. For an event with four lepton mass 
above the on-shell ZZ threshold, the signal and background probabilities for the event, 
Ph( O I rr ivi = niii) and Pzz(f2|m4f), were calculated analytically using equations from 
Refs. 30|, |4l|. Here, the vector denotes five angles describing the four lepton system in 
its center of mass frame (see Appendix [B]). For an event with four lepton mass below the 
on-shell ZZ threshold, the dilepton masses mzi and mz2 niust be included in the matrix 
element calculations. The signal probability Pi{{'mzi,mz2,^\^}i = ^ai) was calculated 



We thank J. Campbell and C. Williams for making the beta version of the NLOME code available to us. 
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analytically [16] , while the background probability was tabulated from template distributions 
for trizi, ^Z2i and the five angles, obtained with the POWHEG simulation at generator 
level [T], |67| . The analytic computations of the matrix elements did not include same- type 



lepton permutations and associated interferences effects in the SF four lepton final states. 

In the recently released H — )■ ZZ — )■ M analysis update by CMS js], the MELA package 
was upgraded and now includes the background probability Pzz( "^zi, w^z ?, ^ | ) for the 
low mass range calculated analytically using parameterizations from Ref. 45|]. The permu- 
tation/interference effects for the SF four lepton final states are still ignored. In the same 
Ref. [8|, CMS also used the MEKD package, which has complete leading order matrix el- 
ements for signal and background, including the proper treatment of lepton permutations 
and the associated interference present in SF four lepton final states. 

In their recent update on the H — > ZZ — > 4£ channel [9j, ATLAS also used the matrix 
element method for testing various spin-parity hypotheses for the newly discovered boson. 
In these analyses, the kinematic discriminants are built solely from signal matrix elements 



based on Ref. 



16|. Hence, the constructed kinematic discriminants do not account for 



permutations of identical leptons and the associated interference. Ignoring the presence of 
background in these MEM-based analyses also makes them even less optimal. 

Given that the permutation/interference effects have not always been consistently imple- 
mented in the past, a large part of the results and discussion presented below are focused 
on the investigation of the effects from interference on the physics results. We show that 
in some cases the interference effects play a significant role and should not be neglected in 
separating alternative hypotheses describing the observed excess of events near the mass of 
125 GeV. 



III. MATRIX ELEMENT KINEMATIC DISCRIMINANT (MEKD) 



The MEKD code is a publicly available package 68|; the instructions for download and 
usage are provided in Appendix O This code uses MadGraph to calculate complete LO 
matrix elements for signal gg ^ X ^ ZZ — 4£ and background process qq — ?■ ZZ — >■ 
4i and also builds the kinematic discriminants according to Eqs. and (jH]). The SM 
Lagrangian and the non-SM additions are implemented automatically into MadGraph via 



FeynRules 



49| 
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The MEKD calculations are validated twice: first, by comparing matrix elements calcu- 
lated by MadGraph and CalcHEP, both embedded into the same MEKD framework; 
and, second, by comparing the matrix elements calculated within the MEKD framework 
with the LO matrix elements calculated by the standalone NLOME package. The Mad- 
Graph vs. CalcHEP validation is performed for all three final states, 4e, 4/i, 2e2fi, 
separately, while MadGraph vs. NLOME validation is performed only for the 2e2/i final 
state (the version of NLOME available to us did not include interference for the 4e and A/j, 
final states). The results are found to be identical for all practical purposes; the details can 
be found in Appendix [XI 

In the current version of MEKD, the SM Lagrangian is extended to allow for non-SM 
couplings of a spin zero or spin two resonance to gluons and to Z bosons. To be specific. 



following 

resonance: 



16 



30|, we implement the following terms in the Lagrangian for a spin zero 



Chzz ^ -^-fHZ^Z^ - ^-^HZ^^Z^- - ^^Z^^Z^^ {dp d'^H) - ^HZ.J'^'', (8) 
J^Hgg 9 -^HF'^F^'^' - ^HF^^F'''^'' - [d^ d"H) - ^HF^^F'''^"' , (9) 



where H is generic scalar field, Z{F)'^ is the vector potential for the Z boson (gluon), and 
Z{FY'^ is the field strength tensor for the Z boson (gluon). For the specific case of the SM 
Higgs boson, one has 

(giz, 92z, 93z, 9az) = ( 0' 0, j , (10) 

{gig, 92g, gSg, Qig) = (0, fi'ofr,0,o) • (H) 

Another hypothesis which we will consider below is a CP odd spin zero resonance, for which 
we have 

{giz, 92z, 93z, 9az) = (0, 0, 0, g'^g z) , (12) 

{gig, 92g, 93g, 9ig) = (O, 0, 0, ^f^gg) , (13) 

where the product of g'^g ^ 9es g is chosen so that we obtain the same total cross section 
times branching ratio as in the case of the SM Higgs boson. Of course, for a spin zero 
resonance, no angular distributions depend on which particular couplings with gluons are 
non-vanishing, at least at leading order. 

15 



In the case of a general spin two resonance, we have 

+h,h^,d^Z^^d''Z^^ + h^h^^Z^^Z'' + h,{d^h^,)Z''d^Z^ + kj^h^^d^Z-d^Z^ 
+h,h^,d^Z''^d''Z^p + fc9.(5a/^M")e^""' (^"^-) Zp 

+kioz {dpdM e^^'P'^ [d'^Z.d^Z^ + d^^Z^d^Z,) , (14) 

and an analogous effective Lagrangian Ccgg describing the coupling of an arbitrary spin two 
resonance to gluons. 

In what follows we will consider the hypothesis of a massive graviton [69], for which the 
values of couplings are 

{kiz, k2z, ksz, kiz, k^z, kez, kjz, ksz, kgz, fcioz) = [kiz, 0, 0, 0, -m\kiz, 0, 0, 0, 0, O) , (15) 
{kig, k2g, ksg, k4g, hg, kfig , k^g, ksg, kgg, kiQg) = {kig, 0, 0, 0, 0, 0, 0, 0, 0, 0) , (16) 



(see e.g. Ref. [70|), with the kiz and kig set to give the same total cross section as in the 
SM Higgs boson case. 

The user has complete freedom to adjust any of the couplings in the spin zero and spin two 
Lagrangians described above. The case of a spin one boson and other extended functionality 
are coming shortly. Hence, the MEKD code can be used for general studies of the Higgs-like 
resonance, and in particular for the measurement of these couplings j?!]. 



IV. SEPARATION OF THE STANDARD MODEL HIGGS BOSON AND BACK- 
GROUND 

As discussed in Sec. Ill CI the effectiveness of a kinematic discriminant can be assessed 
using the corresponding ROC curve. Figure [2] shows ROC curves characterizing the per- 
formance of the kinematic discriminant KD{H; ZZ) that is built to sort events based on 
whether they are more likely to be produced via the SM Higgs boson or by ZZ background 
processes. As shown in Appendix |Xl MadGraph and CalcHEP give identical results, so 
for the purposes of this figure we refer to them as MadGraph/CalcHEP. 

Fig. E] (left) shows that the ROC curves for KD{H; ZZ) as obtained with NLOME and 
MadGraph/CalcHEP for 2e2/i events are identical^ as expected from the comparisons 
in Appendix |Al One can see that if one were to use a simple cut on KD as a part of the 

16 




Background fraction Background fraction 



FIG. 2. Comparison of ROC curves obtained from the different implementations of the kinematic 
discriminant: from MadGraph/CalcHEP (red sohd hnes) or from NLOME (blue dotted lines), 
for DF (left) and SF (right) ii events. 

event selection in a SM Higgs boson search analysis, then one could suppress background 
by about a factor of two, while keeping 90% of signal events. 

Fig. |2] (right) shows that the power of the kinematic discriminant to separate signal 
from background in the SF four lepton final states is approximately the same. However, 
a closer look shows a very small difference, (9(1%), between MadGraph/CalcHEP and 
NLOME(MCFM) results. This is due to the fact that NLOME(MCFM) is missing 
permutations of identical leptons and the associated interference present for SF four lepton 
final states. To alleviate the problem somewhat, we recycle the DF matrix element 

((e-e+)(/i>+))|' 

to make a "symmetrized" matrix element for the SF four lepton final states: 

\MsF{e^ete^ei)\l^^^^^^.^^^^ \Mdf {ie^et)ie^et))\^ + \Mdf {ie^et)ie^et))\\ (17) 

which now includes permutations of identical leptons, but, of course, still misses the inter- 
ference terms. 

The effect of neglecting interference is more visible in Fig. |3l where we compare three 
quantities: |A1h|^ (left column), \A4zzf (middle column), and KD{H; ZZ) (right column), 
all calculated in two different ways using MadGraph. The x axis shows the full results for 
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FIG. 3. Comparison of the signal (left column) and background (middle column) matrix elements, 
as well as the kinematic discriminant i^'DMAD (right column), for signal events (top row) and 
background events (bottom row) as calculated with the full matrix element for SF events |A^||j;> 
(including the interference) versus the approximation |-^|£)p(gj^m) obtained by the patch given in 
Eq. (fTT]) . AU quantities are calculated with Mad Graph. 



the SF final state. The y axis shows the results obtained using the patch shown in Eq. ([T7|) . 
Since we use the exact same code, the only difference between x and y values is whether or 
not the interference contributions are taken into account in the corresponding code. Fig. [3] 
shows a fair amount of scatter for the matrix elements themselves as well as for their ratio; 
this explicitly shows the role of interference. 

For the matrix elements, the scatter becomes noticeably worse for low values of the 
squared matrix element. This can be easily understood: for SF events, there are two ampli- 
tudes corresponding to the two ways of pairing the final state leptons. Normally, one pairing 
has one Z on-shell and the other Z off-shell, while the other pairing has both Zs off-shell. 
The result is then dominated by the on-shell amplitude squared and the interference terms 
are negligible. In order for the interference terms to become noticeable, both pairings should 
have two off-shell Zs in which case the overall matrix element squared will be small. We can 
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easily observe this effect, especially for the background matrix element (the middle plots 
in the figure), for which the Z7* amplitude is large, which more easily allows the Zs to be 
off-shell. The larger spread at low |A^ba'gP appears at high KD (right column plots), since 
the KD ratio is inversely proportional to \M.bkg\^- 

The reason why the obvious scatter seen in Fig. [3] has relatively little effect on the ROC 
curve in Fig. [2] (right) is easy to understand. The observed scatter (up to ±0.5 units of KD) 
is substantially smaller than the typical range of KD values (about 4 units). Therefore, 
using the "incorrect" KD does not significantly broaden the signal and background KD 
distributions, and the signal vs. background separation power is not affected too much. The 
similarity of the left and right plots for DF and SF four lepton events in Fig. |2]is yet another 
sign that the relative role of interference effects is not expected to be large in separating 
signal and background. As we discuss in Sec. IVIIt this is not the case when separating 
alternative signal hypotheses. 



V. INCLUDING INFORMATION ABOUT THE INITIAL STATE 

In the Higgs golden channel, the final state is fully reconstructed and thus the momenta 
of the initial state partons may be inferred. In our discussion thus far, we have ignored this 
additional information, so a very relevant question is how much does one gain from consid- 
ering it. To illustrate the idea, in Fig. H] we plot the rapidity distributions for signal and 
background events generated with PYTHIA 725 at the two relevant LHC energies. The 
dashed histograms represent parton-level distributions before cuts. There is a substantial 
difference in the shapes of the signal and the background — the Z bosons in the background 
events are much more forward, because they are produced from an asymmetric qq state, 
where typically the quark carries a larger momentum fraction. In contrast, the signal dis- 
tributions are much more central, since the Higgs boson is produced from a symmetric gg 
initial state, where the gluons are likely to share the momentum more evenly. 

However, these large differences in the event rapidity distributions wash out in the pres- 
ence of cuts (solid histograms in Fig. H]). The leptons produced in the decays of Zs at high 
rapidity are much more likely to fail the lepton acceptance cuts, which shaves off the high 
rapidity tail in the background distributions. As a result, the signal and background rapidity 
distributions become similar, but not identical. It is therefore worth asking how much one 
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FIG. 4. Unit-normalized rapidity distributions of ZZ* events for signal (bluish colors) and back- 
ground (reddish colors). Results are shown for two different LHC energies (7 and 8 TeV), before 
cuts (dashed lines) and after cuts (solid lines). 

can gain by utilizing the residual rapidity differences observed in the figure. 

To this end, in Fig. Elwe compare the ROC curves obtained from KD with (blue lines) 
and without (red lines) PDF information included. In the absence of the lepton acceptance 
cuts (dashed lines), the difference between the ROC curves for the two processes is quite 
significant, as one might have guessed from Fig. |H However, after the cuts (solid lines), 
the curves become quite similar and the advantage of using KD with PDFs (Eq. ([5])) with 
respect to using KD without PDFs (Eq. ([5])) can be quantified as being at the percent level. 



VI. SIGNAL-BACKGROUND SEPARATION FOR NON-SM SIGNALS 

We showed the utility of the MEM, and in particular a tool like MEKD, for distinguishing 
the SM Higgs from the SM irreducible background. However, in general, the efficacy of the 
MEM in separating an arbitrary resonance from the background may depend on the spin 
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FIG. 5. Impact of parton distribution functions on ROC curves. The red curves are based on 
KD{H] ZZ) from Eq. ^ and do not account for the longitudinal boost of the event, while the 
blue curves are based on KD{H; ZZ) from Eq. ([5]) and include the effect from the parton distri- 
bution functions. Solid (dashed) lines are obtained from event samples with (without) the lepton 
acceptance cuts. 



and CP properties of the new state one is searching for. To illustrate this and to quantify the 
importance of this effect for several cases of interest, in Fig. Elwe show ROC curves indicating 
the ability of the MEM KD to separate signal from background for three potential signals: 
SM Higgs boson (J*"^ = 0"*"), CP-odd scalar boson (J*"^ = 0~), and a massive graviton 
(J*-^^ = 2+) in each of the three possible flavor states. We construct kinematic discriminants 
KD{0^; ZZ), KD{0^; ZZ), and KD{2^; ZZ) and apply all of them for each of the signal 
spin/parity samples. 

There are three important lessons that one can derive from Fig. |6l 
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FIG. 6. ROC curves (as defined in Subsection III Cp describing the separation between signal and 
background using MEKD where the signal is SM Higgs (top row), CP odd Higgs (middle row) 
or graviton (bottom row). The final states considered are e^e^/i+/.i^ (left column), e^e~e^e~ 
(middle column), and i^l^ i^i^ fi^ (right column). 



• The optimal separation between signal and background is achieved only when we use 
a kinematic discriminant KD constructed with the correct signal hypothesis. 

• However, the difference between the three discriminants is not dramatic. This is easy 
to understand since all three signal models assume, by construction, that the new 
resonances decay to Z bosons, while the background has a fair amount of the Z7* 
and even some 7*7* intermediate states. As was discussed in Sec. [TTl (Fig- E]), the mz2 
observable carries a lot of weight in the overall discrimination power of KD{X] ZZ), 
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which makes all three signals fairly alike as far as their separation from the background 
is concerned. 

• There are differences between the ROC curves for the e~^e~fi'^fi~ final state and the 
SF final states for 0~ and 2+ signals. This suggests that, compared with the SM Higgs 
boson, the interference effects may play a more important role when the 0~ and 2^ 
hypotheses are considered. This is discussed further in the next section. 

VII. SPIN AND PARITY DISCRIMINATION 

An important and well-studied use of the golden channel is for determining the spin and 
parity of a putative Higgs boson. As we show below, the use of the exact matrix element 
for constructing kinematic discriminants plays a crucial role in achieving the best separation 
between alternative signal hypotheses. As an example, we consider two alternative signal 
models: a pseudoscalar boson (J'~^^ = 0~) and a massive graviton, which is, of course, a spin 
two boson, (J*^^ = 2^). The couplings for the CP-odd scalar and the massive graviton are 
defined in Sec. IIIII To generate SM Higgs boson, pseudoscalar, and spin massive gravitons 
events, we use MadGraph. The kinematic cuts on the leptons are the same as described 
in Sec. IIIBI We then use ROC curves for the kinematic discriminant KD{J'^^; Hsm) to 
quantify the differences in kinematics of the four lepton system for the J*"^ boson and the 
SM Higgs boson (Hsm), as shown in Fig. [3 

There are three important conclusions to be drawn from this figure. First, one can see 
that the separation between the 0~ boson and Hsm is more pronounced in comparison with 
the 2^ boson case. This indicates that one should be able to tell the difference between the 
0~ boson and Hsm sooner than between the 2+ boson and Hsm- Second, there is a significant 
difference between the SF (4e and 4/i) and DF (2e2/i) four lepton final states. This is a clear 
indicator that the allowed permutations of identical leptons and the associated interference 
in the SF events play a significant role. The detriment to sensitivity in neglecting such 
effects in constructing KD will be discussed below. Third, for the 2^ boson, there is more 
expected separation with the SM Higgs for the SF final state as opposed to the DF final 
state, while for the 0~ boson case the opposite is true. Consequently, one should expect 
that the proper treatment of permutations/interference in the construction of KD should 
lead to a greater ability to separate the 2+ boson from the SM Higgs boson in comparison 
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Signal fraction for SPIN 0* Signal fraction for SPIN 0* 

FIG. 7. ROC curves, shown for all three final state flavors (4e, 4^, 2e2fi), display the separation 
of alternative signal hypotheses: (left) the pseudoscalar (J'-'^ = 0^) vs. the SM Higgs boson (0+); 
(right) the massive graviton (J*-"^ = 2+) vs. the SM Higgs boson (0^). 

with the 0^ boson case. 

In order to further elucidate the role of permutations and interference effects, we proceed 
with a more detailed analysis of the four muon final state, (/^r/^i')(/^2^ f^t)- Here, the first 
dimuon pair, (fi^fif), is formed from the pair of opposite-charge muons whose invariant 
mass is closest to the Z boson mass. We calculate three "matrix elements" for this final 
state as follows: 

1. The complete leading order matrix element squared, |A^4^|^, with all permutations of 
identical leptons and associated interference included. The choice of how one pairs 
muons in this case does not matter; the full matrix element does not depend on how 
a human wants to group the final state leptons. 

2. The "symmetrized" matrix element squared |A^2e2/i|fyin that takes into account per- 
mutations of identical leptons, but ignores the associated interference effects: 

2 I 1^1 I 2 

|A^2e2/.|sym = | -^2e2^( {t^lf^t)e {t^2t^t)t^ ) | + | M2e2^^{ {f^lt^t)e {f^2f^t)t^ ) | , (18) 

where the first pair, marked by a subscript e, is treated as if it were a pair of electrons. 

3. The 2e2yU matrix element squared without symmetrization, |A^2e2At|^, that ignores both 
permutations of identical leptons and the associated interference effects: 

\M2e2^,f = I M2e2^^i il^llJ't)e il^2l^t)t^ ) T ' (1^) 
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FIG. 8. Comparison of kinematic discriminants KD(0~; Hsm) for the four muon final states 
calculated with different levels of approximations. In all four plots, the x-axis shows the KD 
calculated using the complete matrix element squared, |A44^[^. In the plots on the left, the y- 
axis gives the KD calculated using |A^2e2/xlsyjni which takes into account permutations of identical 
leptons, but ignores the associated interference effects. In the plots on the right, the y-axis gives 
the KD calculated using |A^2e2^iP) which ignores both permutations of identical leptons and the 
associated interference effects. The top plots are for SM Higgs signal events, while the bottom 
plots are for a pseudoscalar. 

where the first pair, marked by a subscript e, is treated as if it were a pair of electrons. 

We compare the results obtained using each of these three treatments of the SF final 
state in Figs. [8], |9], and [TOl Fig. [8] and Fig. |9] compare the values of KD{3'^^; Hsm)- In 
Fig. [To] the effect is shown through ROC curves, corresponding to those shown in Fig. [71 
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FIG. 9. Comparison of kinematic discriminants KD{2^; Hsm) for the four muon final states 
calculated with different levels of approximations. In all four plots, the x-axis shows the KD 
calculated using the complete matrix element squared, |A^4^|^. In the plots on the left, the y-axis 
gives the KD calculated using |A^2e2/ilsyjn, which accounts for permutations of identical leptons, 
but ignores the associated interference effects. In the plots on the right, the y-axis gives the KD 
calculated using \A42e2fj,\ , which ignores both permutations of identical leptons and the associated 
interference effects. The top plots are for the SM Higgs signal events, while the bottom plots are 
for a massive graviton. 



These figures clearly show that including or ignoring permutations and interference has a 
significant impact on the value of the kinematic discriminant calculated and the separation 
power between alternative signal hypotheses. Comparing Fig. [8] and Fig. |9] (or the right and 
left plots in Fig. [10]) shows unambiguously that the impact on the expected separation power 
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Signal fraction for SPIN 0* Signal fraction for SPIN 0* 

FIG. 10. ROC curves showing the separation of alternative signal hypotheses for the fi'^ fi^ fi^ fi~ 
final state: (left) the pseudoscalar (J^^ = 0~) vs. the SM Higgs boson (0"*"); (right) the massive 
graviton (J*-"^ = 2+) vs. the SM Higgs boson (0"^). The solid blue curves are for KDs that 
use the full expressions for the squared matrix element for the four muon final state. The green 
dashed curves are for KDs that take into account permutations of identical leptons, but ignore the 
associated interference effects. The red dotted curves are for KDs that ignore both permutations 
of identical leptons and the associated interference effects. 



between the 2^ boson and Hsm is much larger than for the case of the boson versus HgM- 

To translate the observed differences in the ROC curves into the expected differences in 
our ability to tell apart different spin-parity hypotheses, we perform a simplified statistical 
analysis using MEKD-based discriminants. As a toy example, using CalcHEP, we first 
calculate the expected number of events (SM Higgs boson, 0~ pseudoscalar, 2+ massive 
graviton, and ZZ background) for an integrated luminosity of 25 fb~^ at 8 TeV, with the 
following lepton selection cuts applied: > 5 GeV, \ri\ < 2.5, nii+i- > 12 GeV, 120 < 
m^i < 130 GeV. The bosons have mass 125 GeV. The signal event count is scaled by K- 
factor 1.9 to match the Higgs boson cross sections used by the LHC experiments. The ZZ 
background event count is scaled up by a factor of 1.5 to approximately account for the 
NLO K-factor and presence of reducible 4i backgrounds, where one or more leptons are not 
prompt. This gives the following signal (background) event counts: 4.0(7.5) for the 4e final 
state, 3.9(7.0) for the 4yU final state, and 8.9(15.5) for the 2e2/i final state. We assume that 
the alternative signal hypotheses have the same cross sections as that of the SM Higgs boson 
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and thus cannot be distinguished from it by using the information on the event yields in the 
four lepton mass distributions alone. We do not attempt to introduce any experimental event 
reconstruction/selection efficiencies. Thus the absolute numbers do not reflect accurately 
the expected separation power attainable by the LHC experiments; we are interested, rather, 
in the relative results when one does or does not include permutations/interference in the 
construction of kinematic discriminants. The statistical analysis is based on 2D pdfs for 
alternative signal+background hypotheses: 

• pdf{x, y I HsM + bkg ) for the standard model Higgs boson with background; 

• pdf{x, y I J*"^ + bkg ) for a signal of an alternative J*"^ hypothesis with background, 

where x = KD(JisM] ZZ) and y = KD{J''^^] ZZ). We build these 2D pdfs using events 
generated with MadGraph. For background, we use the SM qq — > ZZ — t- M process. 
Then we proceed with generation of pseudoexperiments, using the expected event yields 
and the 2D pdfs. The pseudoexperiments are generated for two different signal hypotheses, 
the standard model Higgs boson and a boson with alternative J*"^, in both cases assuming 
the presence of the ZZ background. For each pseudoexperiment, we calculate as our test 
statistic, g, the negative log likelihood ratio: 

„ , C{ pseudoexperiment I J*"^ + bkg ) 

q = — 2 In ^ (2t)j 

£( pseudoexperiment | Hsm + bkg ) 

Figure [11] shows the test statistic distributions for two alternative hypotheses: "massive 

graviton plus background" and "SM Higgs plus background" , represented by open and filled 

histograms respectively. Figure [12] shows the test statistic distributions for alternative signal 

hypotheses: "pseudoscalar plus background" and "SM Higgs boson plus background". The 

three plots in each figure differ in how the 4e and 4yU final states are treated in calculations 

of the matrix elements for signal, A^Hsm ^"^^ -^jcp, and background, A^zz, as follows: 

• (left plot) complete leading order matrix element; 

• (middle plot) permutations of identical leptons in 4e and 4yU final states are accounted 
for, but the associated interference between diagrams is ignored; 

• (right plot) both permutations and interference in 4e and 4/i final states are ignored. 
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FIG. 11. Test statistic distributions for the two alternative hypotheses: "massive graviton plus 
background" and "SM Higgs boson plus background", open and filled histograms, respectively. 
The three plots differ in how the 4e and 4^ final states are treated in calculations of the matrix 
elements: (left) complete leading order matrix element; (middle) permutations of identical leptons 
in 4e and 4/i final states are accounted for, but the associated interference between diagrams is 
ignored; (right) both permutations and interference in 4e and 4^ final states are ignored. The toy 
model used for generating pseudoexperiments is described in the text. The projected hypotheses 
separations are stated on the plots. 

For the case of the massive graviton, there is about a 15% increase in sensitivity associated 
with the correct treatment of the SF four lepton final states. The difference is large and 
may become a decisive factor in the ability of ATLAS and CMS experiments to distinguish 
between spin zero and spin two bosons with the data of the 2011+2012 LHC run. As 
expected for the pseudoscalar case, the gain from the correct treatment of the SF final 
states is smaller, about 3%. 



VIII. SUMMARY AND OUTLOOK 

We have shown that matrix element analyses improve the extent to which various Higgs- 
like signals can be discriminated from the background and allow for the measurement of 
the spin and parity properties of the Higgs-like boson in the X — )■ ZZ* — )■ 4i channel. We 
have demonstrated the importance of using the full matrix element including permutations 
of identical leptons and the associated interference terms for the SF four lepton final state. 
The proper treatment of these effects enhances our ability to distinguish between different 
spin and parity signal hypotheses. A gain in sensitivity as large as 15% can be expected for 
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FIG. 12. Test statistic distributions for the two alternative hypotheses: "pseudoscalar plus 
background" and "SM Higgs boson plus background", represented by open and filled histograms, 
respectively. The three plots differ in how the 4e and 4/i final states are treated in calculations of the 
matrix elements: (left) complete leading order matrix element; (middle) permutations of identical 
leptons in 4e and ifi final states are accounted for, but the associated interference between diagrams 
is ignored; (right) both permutations and interference in 4e and 4^ final states are ignored. The toy 
model used for generating pseudoexperiments is described in the text. The projected hypotheses 
separations are stated on the plots. 



the case of SM Higgs vs. massive graviton hypothesis separation. 

As the ATLAS and CMS experiments move toward precision studies of the properties 
of the observed boson, the kinematic discriminants will play larger and larger roles. Thus, 
the demand will grow for automated tools allowing for calculations of matrix element-based 
discriminants that most accurately capture all features of the underlying physics of the 
different signal and background processes involved. 

The studies presented have been carried out with the MEKD code, which is now publicly 
available. The user of this code has the flexibility to construct signal matrix elements for 
an arbitrary set of allowed couplings for different spin zero and spin two resonances; the 
matrix element for the background process qq — )■ 4i is also a part of the package. The 
general couplings of a spin one boson as well as the ability to consider the final state with 



four leptons and a bremsstrahlung photon will be added shortly [7l| 
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Appendix A: Validations of MEKD against CalcHEP and NLOME 
1. Comparison with CalcHEP 

Following the same procedure as in Sec. Illlt we also created an independent code to 
compute KDq Ahc, where the matrix element is calculated by CalcHEP. As in Sec. Illlt 
the SM is implemented via FeynRules, which ensures that our KDmad and KDq alc 
results are obtained with identical inputs for the SM parameters (masses, couplings, etc.). 
Since there are no functionality differences between MadGraph and CalcHEP, one would 
expect that KDmad and KDcalc should be the same; therefore this provides a useful cross- 
check. 

We perform such consistency checks on both SM Higgs signal and background samples, 
for DF as well as SF final states. The results are displayed in Fig. [121 which reveals that, 
as expected, KDmad and KDqxi^c are in excellent agreement. Perfect correlation is also 
observed for the signal and background matrix elements as calculated by the two tools. This 
synchronization exercise serves a dual purpose: first, we are able to validate our code, and 
second, the level of agreement seen in Fig. [l3]provides a benchmark for the other comparisons 
in this paper. 



2. Comparison with NLOME 



The MCFM code 58|] can be inverted to compute the matrix element weight |A^P from 
a given final state kinematic configuration. The corresponding code, NLOME js^l, is still 
under development. Here we use a beta version of NLOME to cross-check against our 
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FIG. 13. Comparison of the kinematic discriminant, KDcalc^ computed with CalcHEP, and 
the kinematic discriminant, -RT-DmaDi computed by MadGraph, for SM Higgs signal events (top 
row) and background events (bottom row), using 4e events (left column), 4/i events (right column), 
and 2e2/i events (middle column). 



results from the previous two sections. We should mention that as an NLO tool, NLOME 
involves an integration over the unknown longitudinal momentum of the additional jets, 
which are recoiling against the 4i system. However, here we are using LO events, since we 
are interested in comparing the leading order machinery implemented in the different tools. 
Therefore, we removed the additional longitudinal momentum integration in NLOME and 
we are running the NLOME code in a purely LO mode. 

The comparison between KDuad and -R'-Dnlome for DF M events is shown in Fig. [HI 
The two calculations for 2e2/i final states are on equal footing and their results should agree. 
Indeed, this is what Fig. [14] shows: the level of agreement is excellent and comparable to 
what we observed earlier in Fig. [131 
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FIG. 14. Comparison of the kinematic discriminants i^i^NLOME and XDmad for DF 2e2// events 
for SM Higgs signal (left plot) and background (right plot). 

Appendix B: Notation and conventions 

At leading order, events in the Higgs golden channel are described by 10 degrees of 
freedono. Following 16|, |30[ , we can take them to be as follows: 

• Three invariant mass parameters: M4^, Mz\ and Mz2- 

• The rapidity yzz* of the event in the LAB frame, see Sec. |Vl 

• Two angular variables defined in the CM frame of the whole event. These can be 
chosen to be the polar angle Q* (measured from the beam axis) and the azimuthal 
angle $* of the Z\ system, as shown in Fig. [151 

• Two angular variables defined in the CM frame of the system. These can be taken 
to be the azimuthal angle $i and the polar angle Q\ (measured from the Z\ direction 
in the Higgs CM frame) of the lepton produced in the Z\ decay (refer to Fig. [T5|) . 

• Two angular variables defined in the CM frame of the Z^ system. These can be taken 
to be the azimuthal angle $2 and the polar angle 6*2 (measured from the Z^ direction in 
the Higgs CM frame) of the lepton produced in the Z2 decay. It is often convenient 
to trade the angle $2 for $ = $2 — 



^ There are four particles in the final state, whose momenta give 4 x 3 = 12 parameters, two of which are 
removed by transverse momentum conservation. 
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FIG. 15. Definition of the angular variables relevant to the H — )■ ZZ* M event topology. 



Appendix C: The Matrix Element Kinematic Discriminant package, MEKD 

. The 



The computer code for calculating KDmad can be freely downloaded from 
website also includes instructions and examples for installing the package and running the 
producer, which for completeness are also included here. 



1. Description of the code 

The Matrix Element Kinematic Discriminant, MEKD, package provides tools to calculate 
the leading order (LO) matrix elements for signal gg —> X ^ ZZ* — t- Ai and background 
qq — > ZZ* — i- A£ processes, and to build kinematic discriminants KD that can be used for 
separation between these processes. The supported signal processes include the production of 
a scalar and a spin-2 resonance X through gluon-gluon fusion and its decay into four leptons 
via two Z gauge bosons, where the couplings of the scalar resonance to gauge bosons are 
kept general (parametrized). The MEKD package consists of two parts: 

1. MEKD interface class (and the corresponding MadGraph libraries it depends on) 

2. MEKD producer (macro defined in the runKD_MAD . cc) 
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The MEKD class is declared in MEKD . h and provides an interface to the methods to calculate 
the MEs and KDs for the chosen signal and background processes using the kinematic 



information about the four leptons in the final state. Detailed description o 



68|. 



MEKD interface 



methods can be found at the dedicated MEKD class reference page at 

The macro runKD_MAD.cc is an executable that can run over the input sample of four 
lepton events and as an output produces a file with MEs and KDs for each of the input 
events. Its functionality is based on the MEKD interface class. The MEKD package 
provides the following functionalities: 

1. Accepts as input the tabulated data file with the kinematic information of four leptons 
in the final state. The format of the input file is the following: 

idi id2 ids id^ pi^ piy pi^ d P2x P2y P2z 62 PSx P3y P3z 63 Pix Piy PAz 64 

where idN, Pnx, PNy, Pnz and cat are the PDG id, spatial components and time 
component of the Nth lepton four momentum, respectively. 

2. Allows the user to select the PDFs that should be taken into account in ME calculation. 

3. Initializes the couplings of the resonance according to the user's signal selection. 

4. Performs a purely transverse boost of each event to a reference frame where the trans- 
verse component of total momenta of the four lepton system is zero. 

5. Feeds the boosted momenta of all four leptons to the ME calculator and computes 
MEs and the KD for the given event. 

6. Provides the output as the tabulated data file with the computed ME and KD values. 
The format of the output file is the following: 

\Mzz\' \Mxzz\' KD 

where |A^zzP, \-Mxzz\'^ and KD are the squared ME for the background process, 
the squared ME for the selected signal process and the kinematic discriminant KD, 
respectively. 

7. Prints out the status of the code execution and the summary message at completion. 
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2. User instructions 

a. Requirements 

The latest version of the code can be downloaded and installed following the instructions 
at the MEKD web site: 

\protect\vrule widthOpt\protect\href {http : //mekd. ihepa. uf 1 . edu/}-[http : //mekd. ihepa.uf 1 . e 

Compilation of the code requires an installed gcc compiler. GCC binary packages for multiple 
platform can be found here: 

\protect\vrule widthOpt\protect\href {http : / /gcc . gnu . org}{http : //gcc . gnu . org} 

b. Setup of the MEKD code 

The package can be installed from the downloaded tarball by typing in a terminal window 

tar xvf MEKD_Madgraph.tar 
cd MEKD/macros 
. setup . sh 

The script will compile and link all necessary MEKD libraries/macros. One of the outputs 
is the main executable runKD_MAD. If ROOT framework is installed and the environment 
properly set, the code will be compiled with ROOT support. 

c. Run the MEKD producer 

In a terminal window, type 

./runKD_MAD [-f input_file] [-x x_resonance] [-p pdf .include] [-1 log_file] 

where the available options are described in Table [B 

For example, to run the . dat file provided in the package, execute command: 

./runKD.MAD -f DATA/Events/SIG_41_30evt . dat 

The user can define a resonance with custom couplings by specifying as the type of resonance 
"Custom" and by specifying manually the desired couplings in the file: 
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input_f ile 


name of the input . dat file (string, REQUIRED) 


x_resonance 


choice of the signal resonance (string, DEFAULT = 'SMHiggs') 
Available options: SMHiggs, HiggsOM, Graviton2PM and Custom. 


pdf .include 


name of PDFs, not used if pdf _include=' ' (string, DEFAULT= ' CTEQ6L ' ) 


log_f ile 


name of log file, no logging if log_f ile=' ' (string,DEFAULT= ' ') 



TABLE 1. Options of the MEKD producer. 



src/ Cards/par aiii_card . dat 

Examples of files with couplings parameters can be found in template .dat files located in 
the same directory. 

Options and details on input parameters can always be printed out as: 

./runKD.MAD -h 

d. Output from the MEKD producer 

To run the code with a user-provided input file, yourInputFileName.dat, evaluate 
./runKD_MAD -f yourInputFileName.dat 
and the output will be stored in the . dat file named 
yourInputFileName_withDiscriminator . dat 

It includes the MEs for selected signal and background, as well as the KD value. 

e. Comparison of user MEKD results 

The results of a custom user code which uses the MEKD . h hbraries can be compared to 
the results obtained by the runKD_MAD.cc macro. User code should be run on the input 
. dat files (SIG and BKG) which contain 30 events located here: 

DATA/Events 
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The output files with MEKD results can be compared to the reference output files of the 
"runKD_MAD" macro located in the same directory. 
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